lmSubsets: Exact Variable-Subset Selection in Linear Regression for R
نویسندگان
چکیده
منابع مشابه
Group subset selection for linear regression
Two fast group subset selection (GSS) algorithms for the linear regression model are proposed in this paper. GSS finds the best combinations of groups up to a specified size minimising the residual sum of squares. This imposes an l0 constraint on the regression coefficients in a group context. It is a combinatorial optimisation problem with NP complexity. To make the exhaustive search very effi...
متن کاملAn Exact Implicit Enumeration Algorithm for Variable Selection in Multiple Linear Regression Models Using Information Criteria
For large multivariate data sets the data analyst often wants to know the best set of independent regressors to use in a multiple linear regression model. Akaike’s Information Criteria (AIC) is one information criterion calculated in SAS that is used to score a model. For a small number of independent variables p, an explicit enumeration of all possible 2 models is possible. However, for large ...
متن کاملFWDselect: An R Package for Variable Selection in Regression Models
In multiple regression models, when there are a large number (p) of explanatory variables which may or may not be relevant for predicting the response, it is useful to be able to reduce the model. To this end, it is necessary to determine the best subset of q (q ≤ p) predictors which will establish the model with the best prediction capacity. FWDselect package introduces a new forward stepwiseb...
متن کاملVariable selection in linear regression through adaptive penalty selection
Model selection procedures often use a fixed penalty, such as Mallows’ Cp, to avoid choosing a model which fits a particular data set extremely well. These procedures are often devised to give an unbiased risk estimate when a particular chosen model is used to predict future responses. As a correction for not including the variability induced in model selection, generalized degrees of freedom i...
متن کاملAlternative Strategies for Variable Selection in Linear Regression Models
1. INTRODUCTION 1.1.1. Variable Selection for Incomplete Data sets In statistical practice, many real-life data sets are incomplete for reasons like non-responses or drop-outs. When a data set is incomplete, practitioners frequently resort to a " case-deletion " strategy within which the incomplete cases are excluded from analysis and the complete cases are formed into a reduced rectangular com...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Statistical Software
سال: 2020
ISSN: 1548-7660
DOI: 10.18637/jss.v093.i03